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Abstract. We introduce the epidemic quasimetric on graphs and study its behavior with 
respect to clustering techniques. In particular we compare its behavior to known objects 
such as the graph distance, effective resistance, and modulus of path families. 



1. Introduction 

Consider a finite contact network and assume, at first, that a single individual is infected. 
Then all of its neighbors become potentially suceptible, and then all of its neighbors' neigh- 
bors, etc... The potential damage of the epidemic spreads as a circular wave. We use this 
dynamic to assign to every pair of nodes of a finite graph a number, which we call "epidemic 
distance". To compute the epidemic distance between a and b we expand the range of an 
epidemic started at a until b is affected and compute the number of nodes that became 
suceptible in the process. Then we do the same interchanging a and b, and we add the two 
numbers thus obtained. Epidemic distance is fairly easy to compute numerically and we 
describe the routine we implemented in Matlab in Section |5| 

Our goal is to compare the notion of epidemic distance to the more classical notion of 
effective conductance, when the graph is viewed as an electrical network. Effective conduc- 
tance has been used in the literature to study graphs from the point of view of community 
detection and sparsification. Therefore, such a comparison gives us hope that epidemic dis- 
tance can also be used effectively to study graphs while being relatively simple to compute 
numerically. 

The paper begins by reviewing the theory of random walks on finite graphs in Section [3} 
its connection to electrical networks, and the notion of effective conductance in Section |4j 

Then, in Section |6| we introduce two more concepts drawn from modern geometric function 
theory. Namely the notions of capacity and modulus of families of paths. In Theorem 6.4 we 



show that all of these concepts coincide with the notion of effective conductance. Moreover, 



in Proposition 6.2 we exploit the definition of modulus to obtain a comparison between 
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modulus and epidemic distance. Thus we get a comparison between epidemic distance and 
effective conductance. 

Finally, in Section [7| we experiment with the epidemic distance and show how it can 
be used to obtain a pretty accurate cut of the famous Karate Club graph into "natural" 
communities. 

We begin with some preliminaries on elementary graph theory. 

2. Graphs 

2.1. Notation and generalities. We will restrict our study to simple, finite, connected 
graphs. Let G = {V, E) be a graph with vertex-set V and edge-set E. We say that G is 
simple if there is at most one undirected edge between any two distinct vertices, and it is finite 
if the vertex set has cardinality |y| = G N. In this case, the edge-set E can be thought of 
as a subset of (^) , the set of all unordered pairs from V. Therefore the cardinality of E is 
M =\E\< (^) = iN/2){N - 1). 

We say that two vertices x,y are neighbors and write x ~ ?/ if {x,y} G E. The graph is 
connected if for any two vertices a,h eV there is a chain of vertices = a, xi, . . . , x„ = 6, 
so that Xj ~ Xj-t-i for j = 0, . . . ,n — 1. It is known that connected graphs must satisfy 
\E\>N -I (induction). 

Given a subset of vertices V C V , we let E{y') d E he all the edges of G that connect 
pairs of vertices in V . With this notation G{y') = {V , E{V')) is a simple graph which we 
call the subgraph induced by V. More generally, a subgraph of G is a graph G' = {V',E') 
such that V CV and E' C E{V'). 

The number of edges that are incident at a vertex x is called the degree of x and we write 
d{x). Since every edge is incident at two distinct vertices, it contributes to two degrees. 
Therefore 

^d(x) = 2\E\. 

x&V 

This identity is sometimes referred to as the "Handshake Lemma". It says that instead of 
counting edges, one can add degrees, i.e., switch to d{x) which is a function defined on V . 

For instance, the volume of a subgraph H = {V{H), E{H)) of G can be defined as \E{H)\, 
i.e., the number of edges of iJ; or as half the sum of the i7-degrees over the vertices of H: 

^ dnix). 

xev{H) 

We say that 7 is a path in G if 7 is a connected subgraph of G. 

The graph G is weighted if there is a weight function W : E ^ [0, +00) defined on the 
edges. The unweighted graph is recovered by setting W{e) = 1, Ve G E. 
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Given a path 7 in G, it is natural to define its graph-length to be the total weight of the 
edges of 7, i.e., 

(2.1) Lengthc(7) := W{e). 

eeS{7) 

2.2. The graph distance. A function of two variables d{x, > on a space X is called 
a metric or a distance, if it is symmetric, d{x,y) = d{y,x); non-degenerate, d{x,y) = iff 
X = y; and satisfies the triangle inequality, d{x,y) < d{x,z) + d{z,y), whenever x,y,z G X. 

On a connected graph G the graph distance dc{x, y) is defined as the shortest graph-length 
of a path connecting x to y. 

(2.2) dG{x,y) ■■= min LengthG(7), 

where j : x y means that 7 is a path connecting x to y. We leave the verification that do 
is a metric to the reader. 

The diameter of the graph G is 

Diam(G') = max dG{x,y). 

X,yS:V 

In the next two sections we survey Markov chains and electrical networks. Most of this 
material can be found in |LPW2009j . |G2010j . |DS1984j . 

3. Markov Chains 

A Markov chain is comprised of a finite set S (the state space) and a probability distribu- 
tion on the state space which can be represented in terms of a transition matrix {P{i,j)} = P 
whose entries correspond to the probability of being at state j at time n + 1 given that you 
were at state i at time n. P being a transition matrix means that we must have P{i, j) = 1 
(Row sums add up to 1.) 

This determines a process, i.e., a sequence of random variables Xn = Q S, with the 
property that P(X„+i = y\Xn = x) = p{x,y). Note that P{x,y) is independent of n. 

Example 3.1. A random walk on a graph is an example of a Markov chain. Consider a 
finite, simple graph G = {V,E) with vertices V = {1,2,...,A^} and edges E. For x E V, 
dx indicates the degree or local index of x. We've seen that, by the "hand-shake" lemma. 

Define 

Note that P('I, v) = E„~., = (V'iJ T.„~, 1 = 1- 
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3.1. Matrices act on column vectors. Given N states, {P{x,y)} — P is an. N x N 
matrix. Here N x 1 vectors correspond to functions / : 5" — > R on the state space. Then P 
acts on functions as follows: 

g{x) = {pf){x) = J2Pi^^y)fiy)- 

yes 

Probabilistically, 

y&S 

which is the average or expected value of / evaluated on the process at time 1. 
On a graph: 

So g is obtained from / by defining g{x) to be the average of the values of / over all the 
neighbors of node x. 

Definition 3.2. Whenever x and (P/)(x) — f{x) we say that / is a harmonic function 
on the graph at x. 

Example 3.3 (Gambler's Ruin). Consider six nodes {0, 1, 2, 3, 4, 5} representing the dollar 
amounts held by a gambler. After each bet the gambler either wins or loses a dollar with equal 
probability. The gambler will walk away whenever his fortune is (ruin) , or 5 (predetermined 
goal). We represent this with a transition matrix P, which is a 6 x 6 matrix in this case. 

We would like to know the probability of reaching 5 before 0, assuming that we start our 
random walk at node 3. This is an example of a "hitting probability", what in complex 
analysis would be called a "Harmonic Measure" problem. 
Let 

h{x) = ¥{Xn hits 5 before 0\Xo = x) = P^(X„ hits 5 before 0), 
where P^; is probability conditioned on {Xq = x}. 

We want to compute h{3). We call B = {0, 5} boundary points while / = {1, 2, 3, 4} arc called 
interior points. For x G /, we can condition on the first step; this is known as "first-step 
analysis". For brevity we write A := {X^ hits 5 before 0}: 

h{x) = P,(A) 

= ¥,{A\Xi = X - l)P,(Xi = X - 1) + F,{A\X, ^x + l)P,(Xi = X + 1) 

= lhix-l) + h{x + l) 

since P{x, y) = l/2if|/ = a; — lorx + l, and otherwise. The function h is thus harmonic 
for P at each interior point. Also h has boundary values h{0) — and h{5) = 1. 
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Fact 3.4 (Maximum Principle). A harmonic function achieves its maximum value M and 
minimum value m on the boundary. 

Idea: suppose / attains its maximum at an interior point. By harmonicity it must attain 
it at each neighboring vertex too and this will propagate out as an oil spill all the way to 
the boundary. 

Claim 3.5. The problem of finding a harmonic function h on I with boundary values h{0) = 
and h{5) = 1 has a unique solution. 

Proof. Suppose that h and g are both harmonic at each interior point and that h{0) = g{0) = 
0, h{5) = g{5) = 1. Let f = h — g. Then, by linearity, we have that 

fix) = ^f{x-l) + lf{x + l) 

with /(O) = and /(5) = 0. We get that M = max{f) = and m = min{f) = and the 
Maximum Principle implies that f = ot g = h. □ 

Remark 3.6. The example of Gambler's Ruin above is part of a larger set of problems. Given 
a subset B of nodes that we will call "boundary" and a function Hb defined only on B, it is 
always possible to extend this function on the remaining nodes (the interior points) so that 



the extension h is harmonic on the interior points. Claim |3.5| shows that if a solution exists 
it is unique. However, the existence is obtained by writing down a solution explicitly. For 
this we must introduce the notion of stopping times. Given a walker starting at some interior 
node X the stopping time tb is the first time the walker visits a node in B. Since the walk is 
a random process, stopping times are random variables. The solution to the boundary- value 
(or Dirichlet) problem is 

h{x) := E^ihBiXr^)) 

namely, the expected value of Hb evaluated at the exit point of a walk started at x. 

3.2. Matrices act on row vectors. On a finite graph a random walker either runs until it 
hits a given set of boundary points, as in the Gambler's ruin example, or it bounces around 
forever. In the latter case, we can ask what fraction of time does it spend at a node x7 

Intuitively we want to define vr to be the stable distribution if ^^.gv-^l^) — ^ ^(^) 
denotes the probability of finding the random walker at x in the long run. This however 
already implies that the long run stabilizes. 

Let fio{x) denote the initial distribution for Xq and similarily /ii(x) the distribution of Xi, 
after one 'step'. Then, conditioning on the previous location we get: 

f^ii^) = ^l^oiy)^iXi = x\Xo = y) = ^iiQ{y)P{y,x). 
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If we think of fio{x) as a row vector then we can rewrite this in matrix multiphcation form 
as fii = fJ^oP, = fJ'iP = fJ'oP^, ■ ■ ■ ■ The entries of P^ will be of the form 

{P'){x,y) = 5^P(x,^)P(^,i/) = P(X2 = y\Xo = x). 

z 

In general, we have that fin = /^o-P", where P" is the nth-power of the matrix P. 
If, as n goes to infinity, lim„_5.ooyUn = tt, then vr is a unique fixed-point for P. That's 
because for large n 

71 ^ Hn+1 = fJ^nP ~ TtP 

We say that vr is a stable distribution for P if ttP = vr. 

For a Markov chain on a finite state space there are very mild conditions (irreducibility 
and aperiodicity) that guarantee the existence and uniqueness of a stable distribution tt, as 
well as the convergence lim^^oo A'-o-P" = independently of /io- In such cases, 7r(x) > at 
every x E S and l/7r(x) equals the expected return time to x (in formulas E,x{t^)). Namely, 
TT is inversely proportional to the average amount of time it takes for the random walker to 
find its way back to the starting node. The easier it is to come back, the larger vr is, and the 
larger the proportion of time the random walker spends at x in the long run. 

3.3. Reversible chains. The chains we will use in the sequel will have one extra property: 
reversibility. 

Definition 3.7. The Markov chain P is reversible if there is a distribution tt such that 
(3.2) n{x)P{x, y) = n{y)Piy, x) Vx, y E V. 



If P is reversible, then the distribution vr is stable, as the following shows: 
7t{x)P{x, y) = Y^ T^{y)P{y, x) = niy) P{y, x) = 7T{y). 

The reversibility condition is very closely related to the notion of "symmetric" or "self- 
adjoint" matrices. In fact, if we let A{x,y) = 'K{x)P{x,y), then (3.2) is A{x,y) = A{y,x). 
In matrix notation A = DP, where D is a diagonal matrix with entries tt{x). So standard 
results in linear algebra tell us that A and thus P, are linearizable to a diagonal matrix of 
real eigenvalues. Moreover, the action of P on row vectors is the action of the adjoint (or 
transpose) P^ on column vectors and P^ shares the same eingenvalues as P and A. The 
difference lies in the eigenvectors (or eigenfunctions), but there are simple formulas involving 
vr to go from one set to the other. 
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The fact that the rows of P sum to 1 imphes that the eigenvalues of P are between —1 
and 1. To see this note that 

\Pf{x)\ = |5]P(a;,2/)/(y)| 
y 

< (max|/(|/)|) VP(x,y) = max|/(|/)| 
y ^ — ^ y 
y 

Suppose that Pf — Xf. Then, |A||/(x)| < max^^ |/(y)| for all x's. Hence, |A| < 1. 

Finally, the fact that the rows of P sum to one also imply straightforwardly that 1 is an 
eigenvalue of P for constant eigenvectors. 

Example 3.8 (Random walks on weighted graphs). Let G = {V, E; W) be a weighted graph. 
The degree of a vertex is usually the number of neighbors of x, but for a weighted graph it 
can also be defined as 

deg(x) := ^W{x,y) 

Then a (weighted) random walk on G is defined to be the Markov chain with state space V 
and transition matrix 

W(x,y) -f 

— II y X, 



P(x, y) = < 'i'^e^^) 

1^ otherwise. 

In this situation the probability distribution defined at each x &V hy: 



, , deg(x) , ^ sr-^ , , X 
T^\x) := — - — where Z = > deg[z) 



will satisfy 



7i{x)P{x,y) = 



deg(a;) W{x, y) 
Z deg(x) 

^ deg{y) W{y,x) 
Z degiy) 

= Ay)P{y,x)- 

In particular, P is reversible in this case. 

4. Electrical Networks and Effective Resistance 

It is often useful when studying graphs and random walks on graphs to think of them as 
electrical networks. Each edge e then acquires a weight W{e) — C(e) which in this instance 
plays the role of conductance. One recalls from high school physics that the electrical current 
/ through a connection is related to the potential difference V via the formula 

V = RI and / = GV 

where R = 1/G is the resistance and G the conductance. 
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If instead of a simple connection one is looking at a series of connections then simple 
rules allow one to say that the system behaves as if there was only one connection but with 
an appropriately modified resistance. This "virtual" resistance is what we call the effective 
resistance between two nodes. 

4.1. Flows and currents. In order to make this more precise we need to introduce some 
notation. Given two nodes a and h we consider a unit current that is allowed to pass through 
a connected graph with the 'source' at a G and the 'sink' aX h eV . In order to effect this 
passage a certain voltage difference must be applied at a and 6, and a corresponding voltage 
potential will arise at every node x in the graph. Let v[x) denote the voltage at x G V . 

The electric current that runs through the graph is an example of a flow. Mathematically, 
a flow is an assignment of a number j^y (representing its intensity) to every directed edge 
{x,y), that has the property of being antisymmetric, i.e., j{x,y) = —j{y,x). Given two 
nodes a and b, we say j is a flow from a to 6 if the following property is satisfied at every 
node X a,b. 

Kirchoff s Node Law: The flow into a vertex x is equal to the flow out of 
X. In other words, the divergence divj(x) := '^yr^^; j , y) is equal to zero at 
every x ^ a,b. 

Moereover, we also demand that divj(a) > (and thus div j{b) < 0), since it's a flow from 
a to b. The quantity divj(a) is the strength of the flow j from a to b. 

To compute effective resistance we will assume that a unit flow is entering the network at 
a and exiting at b, so divj(a) = — div j{b) — 1 and divj{x) — ior x ^ a, b. 

A flow from a to 6 is furthermore a current flow, if it also satisfles Ohm's law below. In 
this case we use the notation i{x, y). 

Ohm's Law: There is a potential function v deflned on the nodes such that 
for every oriented edge {x,y), R{x,y)i{x,y) = v{y) — v{x) where R{x,y) is 
the resistance of the edge {x, y). 

Whenever a flow can be expressed as the edge difference of a potential function defined on 
the nodes, as is the case for current, then around any cycle Xi ~ • • • ~ x„ ~ Xn+i — X\ the 
following holds necessarily: 



fc=i 

This is known as KirchofF's Potential Law and it turns out to be equivalent to the exis- 
tence of a potential. 



n 
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Combining Kirchoff 's Node Law and Ohm's Law at a; 7^ a, 6, we get: 

= ^i{x,y) 

^ v{y) - v{x) 

= ^ C{x, y)v{y) - C^v{x), 

y^x 

where C{x,y) = 1/R{x,y) is the conductance of the edge {x,y) and = Yly^x^i-'^^v) 
the "degree" of node x when using conductance to weight each edge. In particular, we have 
that 

v{x) = — ^C{x,y)v{y) 

^ y~x 

namely the potential voltage function v must necessarily be harmonic with respect to the 
weighted graph. 

That a current flow satisfying all of these laws and requirements exists is a physical fact, 
however mathematically one has to prove its existence and uniqueness. 

Proving uniqueness is fairly easy because flows that satisfy the two Kirchoff laws mentioned 
above satisfy the Superposition Principle, i.e. given two such flows ii and ^2, then j — ii—i2 
will also satisfy the same laws. 

The existence proof is more challenging and its resolution surprising. A flow can actually 
be constructed explicitly that satisfies all the requirements using the concept of spanning 
trees. 

Probabilistically, the resulting current i{x,y) along an edge {x,y) can be shown to equal 
the expected number of (net) times that the random walker (on the graph weighted by the 
conductances) crosses from x to y. 

4.2. Effective resistance. Given two nodes a and b consider a unit current flow entering at 
a and exiting the network at b. This flow exists and is unique, as explained above, once the 
Neumann conditions, namely the entering and exiting flow, is fixed at a and b. Moreover, 
this current determines a unique voltage potential v at each node (up to a constant). The 
absolute value of the voltage drop between a and b is what we call the effective resistance 
between a and b. In formulas: 

(4.1) 7^efr(a,&) \v{a) - v{b) \ ^ v{b) - v{a). 

4.3. Effective resistance and escape probabilities. Write 

(4.2) C(x):=5]C(x,y), 

yr^x 
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for the local conductance at a node x. Then the random walk on the weighted graph is the 
Markov chain with transition probabihties 

(4.3) P{x,y)--=^ 

C{x) 

whenever y x. 

Effective resistance, or more appropriately, effective conductance, 

(4.4) Ceff(a,6):= 



7^cff(a,6) 

is related to the the probability that a random walker starting at a visits b before returning 
to a. In symbols, recall the notion of stopping time. Assuming that the random walk is at a 
at time 0, we write for the first time it visits node z and r+ for the first time the walker 
revisits a after time 1. We are interested in Pa(Tz < r+), the probability that the random 
walk starting at a visits z before returning to a. 

Proposition 4.1. For any a,b ^V, 

C{a)Fa{n < r+) =Ceff(a,6). 
Proof. The proof is a beautiful application of the maximum principle for harmonic functions 



(Fact 3.4). Consider B = {a,b} to be the boundary and = V \{a,b} the interior. We 
want to find a harmonic function on Q which takes the vale at a and the value 1 at b. By 
the maximum principle, the solution to this problem is unique. We now produce two such 
solutions. 

The first one is, 

h{x) := F^in < r+). 

Harmonicity can be checked by conditioning on the first step. 

The second solution is given by normalizing the voltage function v{x) which is required in 
order to have one unit of current flow in at a and out at b. So let 

9y^> •= ~1T:\ TT = ^eff(^' o){v{x) - v{a)) 



By uniqueness, h = g. 
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Now, by conditional probability, 

^a{n < T+) = P{a, x))F^{n < T+) (first-step analysis) 



J2 ^^f^^"' - ^^^)) i3) 

C[Ctj 

xr^a ^ ' 

^°^/^\ ^ "^{(^7 ^) (Ohm's Law) 

^ divj(a) = \ ^ (by Kirchoff 's node law) 

C{a) C{a) 



□ 



Remark 4.2. The probability p = Pa(Tb < r+) is also known as the escape probability. With 
probability 1 — p escape fails and the walker returns to a before visiting b, at which point 
another identical walker starts out for another attempt. This is akin to flipping a biased coin 
that has probability p of success (Heads). In particular, the number of tosses required to 
achieve success is known to be distributed with the geometric distribution: 

F{N = k) = p{l - p)''-^ for A; = 1,2,3,... 

A calculation using the geometric series shows that the expected number of tosses E(A^) 
equals 1/p. In our context is the number of visits to a before escaping through b (where we 
count t = as a visit) and its expectation is known as the Green's function of the random 
walk started at a and stopped at b. More generally, Gb{a, c) is the expected number of visits 
to c for the walk started at a, before it is stopped at b. It follows from this discussion and 



Proposition 4.1 that Gb{a,a) = C{a) TZcs{a,b). 

4.4. Rayleigh's monotonicity and energy. The energy of a flow j is 

Energy(j):=5^i?(e)(j(e))' 

Notice that for each edge e E E, the quantity j(e) is defined up to a sign change. Therefore, 
the square (j(e))^ is well-defined. 

Proposition 4.3. Consider the unit current flow i from a tob and its corresponding potential 
V defined on the nodes. Given an arbitrary flow k from a to b, 



i?(e)z(e)/c(e) = (f(&) — v{a)) diYk{a). 

ee_E 
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'^^R{e)i{e)k{e) = - R{x,y)i{x,y)k{x,y) (double counting) 

xr^y,x,y^V 

- {v{y) — v{x))k{x,y) (Ohm's law) 

\ ^ 'v{y) k{x, y)-\ ^v{x) ^ k{x, y) {k{x, y) = if x ^ y) 



2 

edE x^y,x,yGV 



yev xev xev yev 



—v{a) divjfc(a) — v{b) divfc(6) (Kirchoff's node law) 

{v{b) - v{a)) divA:(a) (divfe(6) = - divfc(a)) 



□ 



In particular, applying Proposition 4.3 to the case when k equals the unit current flow i 



itself and using (4.1), we flnd that 



(4.5) Energy(i) = {v{b) - v{a)) divi(a) = 7^efr(a, b). 

Theorem 4.4 (Thomson's principle). The unique unit flow from a to b that minimizes 
energy is the unit current flow. 

Proof. Let j be a unit flow from a to 6 and i the unit current flow from a to b. Then k '■= j — i 
is also a flow from a to b, but of strength zero. We have 



Energy(j) = ^i?(e)(z(e) + k{e)f 

ee_E 

= Energy(z) + Energy(A;) + 2 R{e)i{e)k{e) 



e&E 



Using Proposition 4.3 , we see that the cross-term X^eeB -R(e)^(e) A;(e) equals zero since divfc(a) = 
0. Therefore, the energy of j is strictly greater than the energy of i, unless Energy(A;) = 
in which case k = and i = j. □ 



Corollary 4.5. Effective resistance is a (not necessarily strictly) increasing function of the 
edge resistances. 



EFFECTIVE RESISTANCE ON GRAPHS AND THE EPIDEMIC QUASIMETRIC 13 

Proof. Let R{e) < R'{e) for every e G -E, and let i and i' be the corresponding unit current 
flows from a to b. Then 



7^eff = 5^^^(e)(^(e))2 By Q 



< -R(e)(i'(e))^ (By Thomson's principle) 

ee-B 

< -R'(e)(z'(e))^ (By assumption) 



eeS 



Ke By (4.5) 



□ 

Example 4.6. A tree T is a connected graph with no loops. It follows that given two vertices 
X and y on a tree, there is a unique geodesic, i.e., path of minimal length. In this case the 
effective resistance and the graph distance coincide: 

Tiesix,y) = dGix,y), 

because no current can flow along edges that are not part of the unique geodesic. 
Moreover, in a graph G we always have 

(4.6) 7^c^r(a;,?/) < dcix.y). 

To see this, pick a shortest path 7 connecting x to y and note that it will not have any 
loops, since removing an edge along a loop does not affect the connectedness of 7. Then 
complete 7 to a spanning tree T for G, namely a tree on the same N vertices of G which is 



also a subgraph of G. By monotonicity. Corollary 4.5, we have TZes{x,y;G) < TZcsix, y;T), 
because removing an edge from G is equivalent to setting its resistance to be 00. On the 
other hand, since T is a tree 7les{x, y; T) = dxix, y) = ddx, y). 

Proposition 4.7. Effective resistance is a metric. 



Proof. Symmetry follows by reversing the current flow. Non-degeneracy follows from (4.5) 
and the existence of a unit current flow. Finally, the triangle inequality is verifled by noticing 
that inputing a unit current flow at x, extracting it at then reinputing it at z and extracting 
at y is the same as just inputing it at x and extracting it at y. □ 

4.5. Computing effective resistance with matrices. 

4.5.1. Adjacency matrix. The adjacency matrix of a graph is a way to represent which nodes 
of a graph are adjacent to each other. Given a simple graph G = (V, E), let A be the matrix 
with entries: 

AH 7) = l ^ ihj)^E, 
10 otherwise. 
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(We write G E instead of {i,j} G -E by abuse of notation, even though is an 

ordered pair.) 

If the graph is weighted with weight W{i,j), then we set 



otherwise. 



4.5.2. Combinatorial Laplacian. Let G denote a graph with vertex set V = {1,2, ...,A^} 
and edge set E. Then the combinatorial Laplacian L is an x matrix defined by 

{di if«=j, 

-W{t,j) if{t,j)eE 

otherwise 

where di is the degree of the vertex i, i.e. 

di := J2Ait,j). 

j 

Letting D be the diagonal matrix D = Diag((ij), we see that L = D — A. 

Also recalling the transition matrix P for the random walk on G defined in (3.1), we see 
that 

L = D{I -P). 

4.5.3. The square-root of the Laplacian. Given an edge e joining two nodes, arbitrarily assign 
one of these two nodes to be e's tail and the other one to be e's head. Once this choice is 
made we will write e = (x, y) to mean that x is the tail and y the head. 

Consider the following M x N matrix, where M = \E\, and A^ = \V\, 

{1 if X is e's head 
-1 if X is e's tail 
otherwise 

We claim that 
(4.8) L = B^WB 

where W is the M x M diagonal matrix with entries W{e) (the weight of edge e). To check 
this, notice that 

{B^WB){x, y) = J2 B^^^^ e)W{e)B{e, y) = J2 ^)B(^^ ^)^(e)- 

e e 

If X y, the only term that survives in the sum is when x and y are neighbors and e = (x, y), 
then we get —W{x,y). If x = y, any edge from x contributes B{e,xYW{e) = W{e), so we 
get the degree at x. 
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4.5.4. Quadratic Form. A quadratic form is a function of the form Q{x) = Ax, where A is 
a symmetric matrix. Since the combinatorial Laplacian is symmetric we can write 

•'^Lv= ^ v{x)L{x,y)v{y) 



V 

x,yeV 



2 



J2w{e){Bv){er 

e 

J J2 W{x,y){v{x) -v{y)f 



2 

It follows that the quadratic form associated to the Laplacian is positive semi-definite. In 
practice, this implies that the eigenvalues are greater than or equal to zero, as can be seen 
from the so-called "Minimax characterization": let Ai < A2 < ■ ■ ■ < Aat, then 

_ . x^Lx 

Ak = mm max — - — , 

S:DimS=k x±S X^ X 

where S" is a linear subspace of M^. 

Remark 4.8. If v is the potential resulting from inputing a unit current flow i at a and 
extracting it at b, then using Ohm's Law and ( |4.5[ ) we see that the quadratic form satisfies: 



v''Lv = ^ J2 Vr(x,i/)(t;(x)-i;(y))2 = ^i?(e)^(e)2 = Energy(z) =7^efT(a,6). 



{x,y)eE e 

4.5.5. The Kernel of the Laplacian. We claim that the kernel of the Laplacian consists of 
the constant vectors: 

Ker(L) = Ker(iy^5) = Span(l ... l)^ 

We use the quadratic form to check this: 

Lv = v'^Lv = 

^ WW'^BvW^^ = 



J2w{x,y){v{x)-viy)f = 



x,y 



<^==^ v{x) = v{y) V X, y. 

In particular, this means that the smallest eigenvalue of the Laplacian is 0. Traditionally, 
we relabel the eigenvalues from to — 1, as follows: 



(4.9) Ao = < Ai < < A 



N-l- 
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4.5.6. Diagonalizing the Laplacian. Being symmetric, the combinatorial Laplacian L can be 
diagonalized, i.e., there are eigenvectors uq, ui,. . . ,un-i such that 

N-l 



i=l 



Where U = [uq ui - ■ ■ mat-i], mq = (1 ■ ■ ■ 1)"^ and A is the diagonal matrix of eigenvalues, 



A 



'0 

Ai 








.0 ... Atv-i. 

Recall that given a vector m, the matrix uvT' is a rank one matrix with range the line 
spanned by u. 

Since the Laplacian has a non-trivial kernel, it is not invertible. However, we can still 
define a pseudoinverse called the Green operator. 

N-l 



y- 



Then Ker(^) = Ker(L) and 



N-l 



Lg = gL = J2 



i=l 



which is also equal to the projection in M onto Span(Mi, . . . , un-i)- 

4.5.7. Kirchoff's and Ohm's Law revisited. Let lext denote the current injected at point a 
and extracted at point b, which can be thought of as an x 1 vector 

(0 ... \Ia\ ... -\h\ ... 0)^. 

The current i{e) for each edge e, can be written as an M x 1 vector since there are M edges. 
If the edge e is assigned a head and a tail, then Kirchoff's node law can be written in matrix 
form as 



(4.10) 



where B is the square-root of the Laplacian defined in (4.7). 
To see this check that at every vertex x, 

Iext{x) = '^B'^{x,e)i{e) = ^B{e,x)i{e). 

The only terms that survive in this sum correspond to edges e with either their head or tail 
at X. Since we are free to choose the heads and tails of each edge, let's do so in such a way 
that current always fiows from head to tail. Then B{e, x) = 1 for every edge with an outfiow 
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of current and B{e,x) = —1 for every edge with an inflow of current (and i{e) is always 
positive with this choice of orientation). So the sum we get is exactly the divergence at x. 
Ohm's Law says that the resulting voltage v on the network satisfies 

R[x,y) 

In matrix form Ohm's Law can be written as: 
(4.11) i = WBv, 

as the following computation shows: 



iWBv){e) = y W{e)B{e, z)v{z) = W{e){v{y) - v{x)) 



where e = {x, y), since y is the head and x the tail of e. 
Combining ( |4.10[ ) and ( |4.11[ ) with ( [48| , we see that 



lext = B^WBv = Lv. 

We interpret this as an inhomogeneous problem to solve in v with I^^t given. However, as 
we've seen L is not generally invertible, unless I^xt is perpendicular to the kernel of L, i.e., 
Span (l ... l)^. That's exactly our case, since the current input at a equals the current 
output at b. Therefore, we can apply Green's operator to both sides and get the potential 
drop 

V = QLV = Qlext- 

In other words, writing 

7^,,:= (0 ... 1 ... -1 ... = hxu 

we get that the effective resistance 

7^cff(a, h) = v{b) - v{a) = r/J^w = vlb^Vab- 

If a and b are neighbors, then r/J^ is a row of B corresponding to the edge e = (a,b), and 
thus 

(5^i?^)(e,e) = i?e//(e). 

5. The epidemic distance on graphs 

5.1. The epidemic distance. Given an infected individual at node x the disease can be 
passed on to any of its neighbors and then the process repeats. To model this dynamic we 
introduce the metric balls centered at x: 

B{x,n) := {y eV : dcix.y) < n} 



where dc is the graph-distance defined in 2.2 and n = 1, 2, 3 
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Given two nodes x and we define the epidemic distance between them to be the size of 
the part of the graph that would be affected (the potential damage) if an epidemic started 
at X and reached or vice versa. In formulas, we consider all the vertices in 

B{x,dG{x,y)) 

and form the induced subgraph of G which we call VL{x,dG{x,y)). We then compute the 



volume \Vt,[x, dcix, y))\ as in Section 2.1, by counting the number of edges. So we define the 



epidemic distance between x and y to be 

(5.1) EpidemicDist(a;,?/) := \Q{x,dG{x,y))\ + \Q{y,dG{x,y))\. 

The epidemic distance is not a distance in the mathematical sense. For instance, the 
triangle inequality can fail as badly as possible, as the following example shows. 

Example 5.1. In Figure [T} 

EpidemicDist(xi, 0:2) = 3 and EpidemicDist(x2, Xa) = 4 

while 

EpidemicDist(xi, X3) = N + 5. 

So the triangle inequality in this case can be made to fail as badly as needed by letting 
increase. 



Xi Xo 




Figure 1. Failure of the triangle inequality 



For this reason we've played with the idea of calling it "epidemic damage" instead. 

5.2. Computing the epidemic distance. We wrote a simple routine in Matlab (actually 
we wrote a couple different ones). We are given a simple (possibly weighted graph) in the 
form of an adjacency matrix, see Section 4.5.1 In some cases, matrices are too large and 
have to be entered in list form. This consists of two columns of labels from 1 to A^, where 
each row represents an edge in the graph. 
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We call the adjacency matrix K and normalize it so that all the non-zero entries are equal 
to 1. The matrix K is zero along the diagonal so welei B — K -\- 1. We then take powers 
oi B. An entry in the (i, j) spot of B^ is non-zero if and only if at least one of the terms 
in the sum 

B{t,h)B{h,l2)---B{k,j) 

h,...,lk 

is non zero, i.e., if and only if there is at least one chain (i, li), . . . , {lk,j) connecting i and j. 
Each step {Is, h+i) either moves to a neighbor or stays at the given node. We again normalize 
the entries of B'' to be 1 when non-zero. Let B^ be the normalized version of B'' and define 
Bo^L 

Next, given a node t we look at the row t in the matrix B^ and find a 1 in the column 
corresponding to each node that is graph distance less or equal to k from t. We use this row 
to form a diagonal matrix M and compute the matrix ^ := MKM. The entries of Kt^^ 
are of the form 

(6.2) K,,(r,,) = M(,. im,j)MU,j) = { I " " ^ 

In other words, we see that Kf^k is the adjacency matrix of the subgraph of G induced by the 
vertices that are in the ball B{t, k) centered at t of radius k. We now compute the volume 
by summing all the entries of Kt^k and dividing by 2. This is equivalent to first summing all 
the rows and getting the local degrees and then summing the rows and getting the sum of 
all the local degrees, which by the Handshake Lemma equals twice the number of edges. 

Finally, recall that the graph distance between x and y is the first time y belongs to the 
hail B{x,k): 

dci^^y) = nim{k : Bk{x,y) = 1}. 

The way we actually compute the graph distance is as follows. We flip each zero and one 
in Bk, in practice we take a constant matrix O with entries 0{x,y) = 1 and let Ck{x,y) — 
0{x, y) — Bk{x, y). Then we sum the from to N, i.e., we let D :— Y^^=q ^k- Then 

dG(x,y)-l 

D{x,y)^ ^ l^daix^y). 

k=0 

6. Concepts imported from function and potential theory 

6.1. Capacity. A function u defined on the vertices induces a gradient Pu on the edges, i.e., 
for e = {x,y}, 

Pu{e) := \u{x) - u{y)\. 
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The energy of p„ is 

(6.1) £:(p„) = 5^p.(e)l 

(On a weighted graph we modify this definition as follows: £{pu) = J2e£E Pui^)'^'^{^)-) 

Given two nodes a and 6 in a graph G, we will minimize the energy among all the functions 
M : — )■ M with u{a) = and u{b) = 1. We define the capacity between a and b to be 

Cap(a, 6)= min £(Pu)- 

u{a)=0,u{b)=l ^ 

The function u that attains the minimum is called the capacitary function for a and h. 
Assuming that each edge has unit resistance, recall that Thomson's principle (Theorem 



4.4) and (4.5) imply that the effective resistance between a and h can be computed by 



minimizing the energy of all unit flows between a and 6, and that the minimum is achieved 
for the unit current flow. The electric potential v that gives rise to the unit current flow can 
be used to interpret the unsigned flow as a "gradient" . In fact, by Ohm's Law, for e = {x, y}, 

\i{e)\ = \v{x) -v{y)\. 

The electric potential v is only defined up to an additive constant, what matters is the drop 
\v{a) — f (6)1 which equals 7?.efr(a,6). 
So, dividing v by 7les{a, h) and shifting by a constant C if necessary, we get a function 

such that U{a) = 0, U{b) = 1, and whose gradient is 

, , \i(e)\ 
Pu{e) 



TZefi{a,b)' 



Computing the energy of pu and using (4.5) we get an upper bound for Cap(a, b): 

Cap(a, b) < ] = Ceff(a, b). 

This inequality is in fact an equality, even though the two sides are obtained by minimizing 
the energy of very different objects: flows that don't necessarily admit potentials, on one 
hand, and gradients which are obtained from a "potential" function defined on the vertex 
set, on the other hand. 



Proposition 6.1. We always have 

Cap (a, 6) = Ces(a,6), 
and the function U defined in (6.2) is the capacitary function for a and b. 
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Proof. The quadratic form induced by the combinatorial Laplacian of Section 4.5.4 can be 
used to compute the energy of gradients, since 

u^Lu= ^ \u{x) -u{y)\^ = £{pu). 

{x,y}eE 

In other words, if we number the nodes 1, ■ ■ ■ , A^, the capacity Cap(z, j) for z < j is computed 
by minimizing the quadratic form restricted to the affine subspace of codimension 2 

Aij = {x = (xi, . . . ,Xn) G : Xi = 0, Xj = 1}. 

This can be handled with the method of Lagrange multipliers. Let gi{x) = Xi for i = 1, . . . , 
and /(x) = x^Lx. Then given i < j, 

(6.3) V/ = X^Vg^ + XjVgj 

for some parameters Aj and Xj. 

Notice that Vgi = Cj, the standard unit vector in the i-th direction. Also V/(x) = 2Lx. 



Let w = {wi, . . . ,Wn) be a solution of (6.3). Then, interpreted as a function, w is harmonic 
at every node except possibly for nodes i and j. Moreover, Wi = and Wj = 1. By 
the maximum principle, there is a unique harmonic function with these boundary values. 



Therefore the solution to the Lagrange multipliers problem coincides with U from (6.2), i.e., 
the harmonic function obtained by renormalizing the electric potential v arising from the 
effective resistance problem. □ 

6.2. Modulus of path families. We relax the notion of gradient and consider arbitrary 
p-metrics, i.e., functions p : E ^ [0, +oo). The p-length of a path 7 is then: 

(6.4) Length^(7) = ^p(e). 

££7 



We measure the energy of p as done before in (6.1) for gradients. 

A path family F is a collection of paths 7 in a graph G. We say that p is admissible for the 
path family F if 

(6.5) Length^(7) > 1 V7 G F. 

We write A for the family of all admissible metrics for a given path family F. 
The modulus of F is 

(6.6) Mod(F) = inf ^(p). 

The advantage of modulus is that any choice of admissible metric gives rise to an upper- 
bound. If the family F contains a constant path, then its modulus is infinite. Otherwise, 
choosing p = 1 we see that Mod(F) is bounded above by the number of edges. 
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Given two nodes a and 6, let Mod(a, h) be the modulus of the path-family consisting of 
all paths 7 that contain both a and h. We call this path-family r(a,6). 

Proposition 6.2. 

dcia, bY Mod(a, b) < EpidemicDist(a, b) 

Proof. Recall some of the notations introduced to define the epidemic distance. We consid- 
ered Q{a,dG{a,b)), i.e., the subgraph of G induced by the vertices that are in the ball 
B{a,dG{a,b)). Define a p-metric for T{a,b) by letting p(e) = l/dG{a,b), if e is an edge in 
Q := Q{a,dG{a,b)) U Q{b,dG{a,b)), and let p(e) = otherwise. We claim that this p-metric 
is admissible for r{a,b). To see this pick an arbitray path 7 that contains both a and b. 



By definition of graph distance (2.2), we have Length(^(7) > dG{a,b), and this takes care of 
paths that stay in Q. Some paths might actually exit Q, but if they do then they must exit 
the ball B{a, dG{a, b)) and therefore they will again have p-length greater than one. 



By (6.6) we get that 

Mod(a,6)< 



dGia^b)"^ 

□ 

It turns out that modulus and capacity are closely related concepts. 

Proposition 6.3. We always have 

Mod(a, 6) = Cap(a, 6). 

Proof. Let U be the capacitary function for a and b defined in (6.2), whose gradient pu has 
energy £{pu) equal to Cap (a, b). We first show that pu is an admissible p-metric for r(a, b). 
Let 7 be an arbitrary path from a to b. Then, since 7 is connected, it must contain a chain 
of vertices Xq = a, Xi, . . . , = & so that Xj ~ x^+i for j = 0, . . . , m — 1. By the triangle 
inequality: 

m— 1 



l = \U{a)-U{h)\ = \Y,iU{x,+,-U,) 

j=0 

m~l 

<Y,mx,+i-u,\ 

j=0 
m—1 



j=0 
ee7 
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So pu is admissible for r(a, h) and 

Mod(a,6) < £{pu) = Cap(a,fe). 

Conversely, let p be an arbitrary admissible metric for r{a,b). Define a function u on the 
vertices by 

u{x) = min Length (7), 

i.e., the shortest p-length of paths connecting a to x. 

Then u{a) = (the constant path has length zero), and u{b) > 1 by admissibility of p. 
Furthermmore, for an arbitrary edge e = {x,y}, 

u{y) < u{x) +p(e), 

because the shortest path from a to x followed by the edge e is a path from a to y. Therefore, 
inverting the roles of x and y, we find that the gradient of u satisfies 

pu{e) = \u{x) -u{y)\ < p(e). 

This in turn implies that £{pu) < £{p)i i-e., 

Cap(a,6) < £{p). 

Since p was an arbitrary admissible metric, we can minimize over p and get 

Cap(a,6) < Mod(a,6). 

□ 



Putting Propositions |6.1| and |6.3| together we obtain: 
Theorem 6.4. The three concepts of effective conductance, capacity and modulus coincide: 

Ces{o,,b) = Cap(a, 6) = Mod(a,6). 



We can now exploit the definition of modulus as an infimum, as in Proposition 6^, and 
obtain a comparison between epidemic distance and effective conductance. 

Corollary 6.5. We have 

(6.7) EpidemicDist(a, 6) > ^^(a, 6)^ Ccfr(a, &) = j. . 
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7. Explicit and numerical computations 



7.1. Some closed form expressions. We begin by calculating some of these quantities 
exactly, for special cases or families of cases. 

Example 7.1 (Path graphs). For each N ^ N, P/v is the unique graph on N vertices that 
can be labeled xi, . . . ,xn so that Xj ~ Xj+i for j = 1, . . . , N — 1. 
The diameter of Pn is — 1. 

Being a tree, effective resistance on P„ is equal to the graph distance, i.e., for i < j 

TZcsixi, Xj) = dcixi, Xj) = j - i. 

On the other hand, 

EpidemicDist(xj, Xj) = min{j — i,i — 1} + 2(j — i) + min{j — i,N — j} 



To study how much is lost in (6.7) we consider the "discrepancy": 

7^eff(a5 EpidemicDist(a, 6) 
dG{a,b)^ 

In the case of the path graph Pj^, we have 

2 < 5(xi, X,) = min j 1, 1 + 2 + min j 1, — — ^ 1 < 4. 

Example 7.2 (The star and complete graphs). The ratio 6 can grow when the diameter of 
a tree is very small. 

For instance, the star Sn is a tree with a vertex xq that plays the role of hub and has 
— 1 neighbors. The diameter is 2 and the graph distances are either 1 or 2. On the other 
hand, the epidemic distance is either N or 2(A^ — 1). Therefore 6 grows linearly with N. 

The complete graph K]\f is the graph on vertices with the maximum number of edges, 
i.e., every vertex has — 1 neighbors. Here the diameter is 1, and the graph distance is 
constant equal to 1. Effective resistance can be computed as follow: fix two nodes x and y 
and let V be the remaining nodes in \ {x,y}. By symmetry the potential induced by a 
unit current flow between a and b will be constant on V. Hence no current will flow along 
the Kn-2 complete graph induced by V . We get that V can be thought as a single vertex 
connected by A^ — 2 edges to x and y respectively. Using the "serial" and "parallel" rules 
from high school physics, we find that 

2 

^efF(a;,l/) = — . 

The epidemic distance in this case is constant and equal to A^(A^ — 1). Therefore 5 is also 
constant and equal to (A^ — l)/2. Again we see that 5 grows linearly with A^. 



EFFECTIVE RESISTANCE ON GRAPHS AND THE EPIDEMIC QUASIMETRIC 25 
One might conjecture that the discrepancy 6 always grows at most hnearly in the number 



of vertices, but that turns out to be false. Using (4.6), the volume of the complete graph 
and bounding dG{a,b) > 1, we find that 6{a,b) < N"^. This quadratic worst behavior can 
be achieved by letting G be the graph obtained from the complete graph Kj^j by picking a 
vertex a and connecting it to a new vertex 6 by a single edge. In this situation dda, b) = 1, 
and 7^eff(a,&) = 1, but EpidemicDist(a, 6) = A^(A^ - l)/2 + 2, so 6{a,b) = 0{N^). One 
could say that in this case epidemic distance captures a feature of the graph G that effective 
resistance does not see. 

7.2. Numerical experiments using epidemic distance in clustering algorithms. 

Epidemic distance carries more information for pairs of nodes that are close relative to the 
diameter of the graph. 

In order to write the Matlab code and check it against a concrete graph it was really useful 
to have a very simple object such as the one in Figure [2] 




Figure 2. A simple graph 

Once we wrote the algorithm for computing the epidemic distance we tested how it would 
fare in the case of the famous " Karate Club" , see Figure [3j 

We used the epidemic distance as a measure of "similarity" between nodes in the ag- 
glomerative AGNES algorithm (this we performed in R). We obtained the dendrogram in 
Figure |4} 

Using the two largest clusters we obtained a cut for the Karate club that mislabels only 
three vertices as can be seen in Figure [5j 
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Figure 3. The original split in the Karate Club 
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Dendrogram of agnes(x = hotdogkarate, diss = T) 



hotdogkarate 
Agglomerative Coefficient = 0.78 



Figure 4. AGNES produced this dendrogram 




Figure 5. Epidemic cut of the Karate Club 



